ENTERED 



RAW SEQUENCE LISTING DATE: 08/12/2002 

PATENT APPLICATION: US/10/098,514 TIME: 15:01:01 

Input Set : A:\PTO.VSK.txt 

Output Set: N:\CFF4\08122002\J098514.raw 

3 <110> APPLICANT: Chang, Sandra P 

4 Christopher, David A 

5 Vine, Benjamin 

6 Su, Wei-Wen 

7 Bugos ; Robert 

9 <120> TITLE OF INVENTION: PLASMODIUM FALCIPARUM MEROZOITE SURFACE PROTEIN-1 MALARIA 
PRODUCED IN 



1 n 




TRANSGENIC PLANTS 










12 


<130> 


PILE 


REFERENCE : 


A- 7 1 3 3 9/RFT/TAL/NBC 








14 


<140> 


CURRENT APPLICATION NUMBER 


: US 10/098,514 






p V 1 c 

L."-^ ID 


<141> 


CURRENT FILING DATE: 2002-08-06 








1 7 


<150> 


PRIOR APPLICATION NUMBER: US 09/500,376 






±0 


<151> 


PRIOR FILING DATE: 2000-02 


-08 








on 


<150> 


PRIOR APPLICATION NUMBER: US 60/274,599 






01 


<151> 


PRIOR PILING DATE: 2001-03 


-09 








0 7 


<160> 


NUMBER OF SEQ ID NOS : 19 










0 R 


<170> 


SOFTWARE: PatentIn version 


3.1 








27 


<210> 


SEQ ID NO: 1 












28 


<211> 


LENGTH: 1149 












29 


<212> 


TYPE 


: DNA 












30 


<213> 


ORGANISM: Artificial sequence 








32 


<220> 


FEATURE : 












33 


<223> 


OTHER INFORMATION: synthetic 








35 


<220> 


FEATURE : 












36 


<221> 


NAME/KEY: sigjeptide 










37 


<222> 


LOCATION: (1).. 


(24) 










38 


<223> 


OTHER INFORMATION: 










41 


<400> 


SEQUENCE: 1 












42. 


gccgaattcg 


acaacatcct 


cagtgacaac 


atcctcagtg 


gcttcgagaa 


cgagtacgac 


60 


44 


gtaatctacc 


taaagcccct 


tgccggtgtc 


taccgttcat 


tgaagaaaca 


gatagaaaag 


120 


46 


aatattttca 


cgttcaacct 


caacctaaat 


gacatcctca 


actcgcgcct 


caagaagcga 


180 


48 


aaatacttcc 


tcgacgtgtt 


ggaatccgac 


cttatgcaat 


tcaagcacat 


tagctctaac 


240 


50 


gagtacatca 


tagaggacag 


cttcaagctc 


ttgaattcag 


aacagaagaa 


caccctccta 


300 


52 


aagtcctaca 


aatacattaa 


ggagtctgtt 


gagaacgaca 


tcaagttcgc 


ccaggaagga 


360 


54 


attagctact 


atgagaaagt 


cctggctaaa 


tacaaggacg 


acttggaaag 


cattaagaag 


420 


J' 
/58 

• 60 


gtaatcaaag 


aagagaagga 


aaagtttccg 


agctctccac 


ccacaactcc 


cccatcgcct 


480 


gcaaagaccg 


acgagcagaa 


aaaagaaagt 


aagttccttc 


cattcctcac 


caacatcgaa 


540 


actctatata 


acaacctggt 


gaacaagatt 


gatgactact 


taatcaactt 


gaaggcgaaa 


600 


62 


attaatgact 


gtaacgtcga 


aaaggatgaa 


gcccacgtta 


agatcaccaa 


gctttccgat 


660 


64 


ctcaaagcca 


tcgacgataa 


gattgacctg 


tttaagaacc 


acaacgattt 


cgacgcaatc 


720 


66 


aaaaagttga 


tcaacgacga 


tactaagaaa 


gacatgcttg 


gaaaactgct 


gtcgacaggc 


780 


68 


ttggtccaaa 


acttcccgaa 


caccattata 


agcaagctga 


tcgaaggaaa 


gtttcaggat 


840 


70 


atgctgaaca 


tctctcagca 


tcaatgcgtg 


aagaagcaat 


gtcccgagaa 


ttcaggttgc 


900 


72 


ttccgccact 


tagacgaaag 


ggaggaatgt 


aaatgcctgc 


tgaattataa 


acaggaagga 


960 



file://C:\CRF4^uthold\VsrJ098514.htm 



8/12/02 



Page 2 of 8 
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74 gacaagtgcg tagagaatcc taacccaacc tgtaacgaaa ataacggtgg ctgcgatgct 1020 

76 gacgctaagt gtaccgagga ggacagcggt tccaatggca agaaaataac ttgcgaatgc 1080 

78 acgaagcccg atagttaccc tctcttcgac ggtatcttct gctcccatga tgagctttaa 1140 

80 gagctcacc 1149 

83 <210> SEQ ID NO: 2 

84 <211> LENGTH: 383 

85 <212> TYPE: PRT 

86 <213> ORGANISM: Artificial sequence 

88 <220> FEATURE: 

89 <223> OTHER INFORMATION: synthetic 

91 <220> FEATURE: 

92 <221> NAME/KEY: MISC_PEATURE 

93 <2 2 2> LOCATION: (380).. (380) 

94 <223> OTHER INFORMATION: "Xaa" at position 380 represents a stop codon 
97 <400> SEQUENCE: 2 

99 Ala Glu Phe Asp Asn lie Leu Ser Asp Asn lie Leu Ser Gly Phe Glu 



100 


1 








5 










10 










15 




103 


Asn 


Glu 


xyx 


Asp 


Val 


He 


xyx 


Leu 


Lys 


Pro 


Leu 


Ala 


Glv 


Val 


Tvr 

x/x 


Arcr 


104 








20 










25 










30 






1 07 


OCX 


Leu 


T.VQ 


Lys 


Gin 


He 


Glu 


Lys 


Asn 


He 


Phe 


Thr 


Phe 


Asn 


Leu 


Asn 


1 OR 

X \/ o 
















40 










45 








111 


Leu 


Asn 


Asp 




Leu 


Asn 


OCX 


Arg 


Leu 


Lys 


Lys 


Arg 


Lys 


lyr 


It lie? 


XjCU 


112 




50 










55 










60 










115 


Asp 


Val 


Leu 


Glu 


Ser 


Asp 


Leu 


Met 


Gin 


Phe 


Lys 


His 


He 


Ser 


Ser 


Asn 


116 


65 










70 










75 










80 


119 


Glu 


Tyr 


He 


He 


Glu 


Asp 


Ser 


Phe 


Lys 


Leu 


Leu 


Asn 


Ser 


Glu 


Gin 


Lys 


120 










85 










90 










95 




123 


Asn 


Thr 


Leu 


Leu 


Lys 


Ser 


Tyr 


Lys 


Tyr 


He 


Lys 


Glu 


Ser 


Val 


Glu 


Asn 


124 








100 










105 










110 






127 


Asp 


He 


Lys 


Phe 


Ala 


Gin 


Glu 


Gly 


He 


Ser 


Tyr 


Tyr 


Glu 


Lys 


Val 


Leu 


128 






115 










120 










125 








131 


Ala 


Lys 


Tyr 


Lys 


Asp 


Asp 


Leu 


Glu 


Ser 


He 


Lys 


Lys 


Val 


He 


Lys 


Glu 


132 




130 










135 










140 










135 


Glu 


Lys 


Glu 


Lys 


Phe 


Pro 


Ser 


Ser 


Pro 


Pro 


Thr 


Thr 


Pro 


Pro 


Ser 


Pro 


136 


145 










150 










155 










160 


139 


Ala 


Lys 


Thr 


Asp 


Glu 


Gin 


Lys 


Lys 


Glu 


Ser 


Lys 


Phe 


Leu 


Pro 


Phe 


Leu 


140 










165 










170 










175 




143 


Thr 


Asn 


He 


Glu 


Thr 


Leu 


Tyr 


Asn 


Asn 


Leu 


Val 


Asn 


Lys 


He 


Asp 


Asp 


144 








180 










185 










190 






147 


Tyr 


Leu 


He 


Asn 


Leu 


Lys 


Ala 


Lys 


He 


Asn 


Asp 


Cys 


Asn 


Val 


Glu 


Lys 


148 






195 










200 










205 








151 


Asp 


Glu 


Ala 


His 


Val 


Lys 


He 


Thr 


Lys 


Leu 


Ser 


Asp 


Leu 


Lys 


Ala 


He 


152 




210 










215 










220 










155 


Asp 


Asp 


Lys 


He 


Asp 


Leu 


Phe 


Lys 


Asn 


His 


Asn 


Asp 


Phe 


Asp 


Ala 


He 


156 


225 










230 










235 










240 


159 


Lys 


Lys 


Leu 


He 


Asn 


Asp 


Asp 


Thr 


Lys 


Lys 


Asp 


Met 


Leu 


Gly 


Lys 


Leu 


160 










245 










250 










255 




163 


Leu 


Ser 


Thr 


Gly 


Leu 


Val 


Gin 


Asn 


Phe 


Pro 


Asn 


Thr 


He 


He 


Ser 


Lys 


164 








260 










265 










270 
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167 
168 
171 
172 
175 
176 
179 
180 
183 
184 
187 
188 
191 
192 
195 
196 
197 
198 
200 
201 
203 
204 
205 
206 
209 
210 
212 
214 
216 
218 
220 
222 
224 
226 
228 
230 
232 
234 
236 
238 
240 
242 
244 
246 
248 
251 
252 
253 
254 



Leu 


lie 


Glu 
275 


GlV 


Lys 


Phe 


Gin 


Asp 
280 


Met 


Leu 


Asn He 


Ser 
285 


Gin 


His 


Gin 




Val 
290 


T.VQ 


jjjr o 






295 




noli 


OCX 


300 


It JiC 








Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


Leu 


Leu 


Asn Tyr 


Lys 


Gin 


Glu 


Gly 


305 










310 










315 








320 


Asp 


Lys 


Cys 


Val 


Glu 
325 


Asn 


Pro 


Asn 


Pro 


Thr 
330 


Cys Asn 


Glu 


Asn 


Asn 
335 


Gly 


Gly 


Cys 


Asp 


Ala 
340 


Asp 


Ala 


Lys 


Cys 


Thr 
345 


Glu 


Glu Asp 


Ser 


Gly 
350 


Ser 


Asn 


Gly 


Lys 


Lys 
355 


lie 


Thr 


Cys 


Glu 


Cys 
360 


Thr 


Lys 


Pro Asp 


Ser 
365 


Tyr 


Pro 


Leu 


Fhe 


Asp 

370 


Gly 


lie 


Phe 


Cys 


Ser 

375 


His 


Asp 


Glu 


Leu Xaa 

380 


Glu 


Leu 


Thr 





<210> SEQ ID NO: 3 
<211> LENGTH: 1149 
<212> TYPE: DNA 

<213> ORGANISM: Artificial sequence 
<220> FEATURE: 

<223> OTHER INFORMATION: synthetic 

<220> FEATURE: 

<221> NAME/KEY: sig_i)eptide 

<222> LOCATION: (1)..(24) 

<223> OTHER INFORMATION: 

<400> SEQUENCE: 3 

cggatccaag gagatataac aatggacaac atcctcagtg gcttcgagaa cgagtacgac 60 
gtaatctacc taaagcccct tgccggtgtc taccgttcat tgaagaaaca gatagaaaag 120 
aatattttca cgttcaacct caacctaaat gacatcctca actcgcgcct caagaagcga 180 
aaatacttcc tcgacgtgtt ggaatccgac cttatgcaat tcaagcacat tagctctaac 240 
gagtacatca tagaggacag cttcaagctc ttgaattcag aacagaagaa caccctccta 300 
aagtcctaca aatacattaa ggagtctgtt gagaacgaca tcaagttcgc ccaggaagga 360 
attagctact atgagaaagt cctggctaaa tacaaggacg acttggaaag cattaagaag 420 
gtaatcaaag aagagaagga aaagtttccg agctctccac ccacaactcc cccatcgcct 480 
gcaaagaccg acgagcagaa aaaagaaagt aagttccttc cattcctcac caacatcgaa 540 
actctatata acaacctggt gaacaagatt gatgactact taatcaactt gaaggcgaaa 600 
attaatgact gtaacgtcga aaaggatgaa gcccacgtta agatcaccaa gctttccgat 660 
ctcaaagcca tcgacgataa gattgacctg tttaagaacc acaacgattt cgacgcaatc 720 
aaaaagttga tcaacgacga tactaagaaa gacatgcttg gaaaactgct gtcgacaggc 780 
ttggtccaaa acttcccgaa caeca ttata agcaagctga tcgaaggaaa gtttcaggat 840 
atgctgaaca tctctcagca tcaatgcgtg aagaagcaat gtcccgagaa ttcaggttgc 900 
ttccgccact tagacgaaag ggaggaatgt aaatgcctgc tgaattataa acaggaagga 960 
gacaagtgcg tagagaatcc taacccaacc tgtaacgaaa ataacggtgg ctgcgatgct 1020 
gacgctaagt gtaccgagga ggacagcggt tccaatggca agaaaataac ttgcgaatgc 1080 
acgaagcccg atagttaccc tctcttcgac ggtatcttct gctcccatga tgagctttaa 1140 
gagctcacc 1149 
<210> SEQ ID NO: 4 
<211> LENGTH: 383 
<212> TYPE: PRT 

<213> ORGANISM: Artificial sequence 
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256 


<220> FEATURE: 


























257 


<223> OTHER 


INFORMATION 


: synthetic 
















259 


<22 0> FEATURE: 


























260 


<221> NAME/KEY: 


MISC_FEATURE 


















261 


<222> LOCATION: 


(380) . . 


(380) 


















262 


<223> OTHER 


INFORMATION 


: "Xaa" at position 380 represents a 


stop 


265 


<400> SEQUENCE: 


4 
























267 


Arcr 


lie 


Gin 


Glv 


Asp 


He 


Thr 


Met 


Asp 


Asn 


He 


Leu 


Ser 


Glv 


Phe 


Glu 


268 


1 








5 










10 










15 




271 


Asn 


Glu 


Tvr 

X jr X 


Asp 


Val 


He 


Tvr 


Leu 


Lys 


Pro 


Leu 


Ala 


Glv 


Val 


Tvr 


Arcr 


272 








20 










25 










30 






275 


Ser 


Leu 


Lys 


Lvs 


Gin 


He 


Glu 


Lys 


Asn 


He 


Phe 


Thr 


Phe 


Asn 


Leu 


Asn 


276 






35 










40 










45 








97Q 


Leu 


Asn 


Asp 


He 


Leu 


Asn 


Ser 


Arcr 


Leu 


Lys 


Lys 


Arg 


Lys 


Tvr 


Phe 




^ O V/ 




50 










55 










60 










^ V 1^ 


Asp 


Val 


Leu 


Glu 


Ser 


Asp 


Leu 


Met 


Gin 


Phe 


Lys 


His 


He 


Ser 


Ser 


A cn 


284 


65 










70 










75 










80 


287 


Glu 


Tvr 

4. jr J. 


He 


He 


Glu 


Asp 


Ser 


Phe 


Lys 


Leu 


Leu 


Asn 


Ser 


Glu 


Gin 


T.VG 
XJjr O 


288 










85 










90 










95 




991 
^ ^ J. 


Asn 


Thr 


Leu 


Leu 


Lys 


Ser 


Tvr 

X Jf X 


Lys 


Tvr 

X jr A 


He 


Lys 


Glu 


Ser 


Val 


Glu 


A cn 


292 








100 










105 










110 






295 


Asp 


lie 


Lys 


Phe 


Ala 


Gin 


Glu 


Glv 

VFX jr 


He 


Ser 


Tvr 

xjrx 


Tvr 

X jr J- 


Glu 


Lys 


Val 


Leu 


296 






115 










120 










125 








299 


Ala 


Lys 


Tvr 


Lys 


Asp 


Asp 


Leu 


Glu 


Ser 


He 


Lys 


Lys 


Val 


He 


Lvs 


Glu 


300 




130 










135 










140 










303 


Glu 


Lys 


Glu 


Lvs 


Phe 


Pro 


Ser 


Ser 


Pro 


Pro 


Thr 


Thr 


Pro 


Pro 


Ser 


Pro 


304 


145 










150 










155 










160 


307 


Ala 


Lys 


Thr 


Asp 


Glu 


Gin 


Lvs 


Lvs 


Glu 


Ser 


Lvs 

xjj a 


Phe 


Leu 


Pro 


Phe 


Leu 


308 










165 










170 










175 




311 


Thr 


Asn 


He 


Glu 


Thr 


Leu 


Tvr 

XJX 


Asn 


Asn 


Leu 


Val 


Asn 


Lys 


He 


Asp 


Asp 


312 








180 










185 










190 






315 


Tyr 


Leu 


He 


Asn 


JJCU 


T.vcs 




T.VG 


Tl P 


noli 




C\r fi 
^jf ^ 


A cn 

As 11 






Lys 


316 






195 










200 










205 








319 


Asp 


Glu 


Ala 


His 


Val 


Lys 


He 


Thr 


Lys 


Leu 


Ser 


Asp 


Leu 


Lys 


Ala 


He 


320 




210 










215 










220 










323 


Asp 


Asp 


Lys 


He 


Asp 


Leu 


Phe 


Lys 


Asn 


His 


Asn 


Asp 


Phe 


Asp 


Ala 


He 


324 


225 










230 










235 










240 


■^97 


Lys 


Lys 


Leu 


He 


Asn 


Asp 


Asp 


Thr 


Lys 


Lys 


Asp 


Met 


Leu 


Gly 


Lys 


Leu 


328 










245 










250 










255 




331 


Leu 


Ser 


Thr 


Gly 


Leu 


Val 


Gin 


Asn 


Phe 


Pro 


Asn 


Thr 


He 


He 


Ser 


Lys 


332 








260 










265 










270 






335 


Leu 


He 


Glu 


Gly 


Lys 


Phe 


Gin 


Asp 


Met 


Leu 


Asn 


He 


Ser 


Gin 


His 


Gin 


336 






275 










280 










285 








339 


Cys 


Val 


Lys 


Lys 


Gin 


Cys 


Pro 


Glu 


Asn 


Ser 


Gly 


Cys 


Phe 


Arg 


His 


Leu 


340 




290 










295 










300 










343 


Asp 


Glu 


Arg 


Glu 


Glu 


Cys 


Lys 


Cys 


Leu 


Leu 


Asn 


Tyr 


Lys 


Gin 


Glu 


Gly 


344 


305 










310 










315 










320 


347 


Asp 


Lys 


Cys 


Val 


Glu 


Asn 


Pro 


Asn 


Pro 


Thr 


Cys 


Asn 


Glu 


Asn 


Asn 


Gly 


348 










325 










330 










335 
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351 Gly Cys Asp Ala Asp Ala Lys Cys Thr Glu Glu Asp Ser Gly Ser Asn 

352 340 345 350 

355 Gly Lys Lys lie Thr Cys Glu Cys Thr Lys Pro Asp Ser Tyr Pro Leu 

356 355 360 365 

W--> 359 Phe Asp Gly lie Phe Cys Ser His Asp Glu Leu Xaa Glu Leu Thr 
360 370 375 380 

363 <210> SEQ ID NO: 5 

364 <211> LENGTH; 38 

365 <212> TYPE: DNA 

366 <213> ORGANISM: Artificial sequence 

368 <220> FEATURE: 

369 <223> OTHER INFORMATION: synthetic 

371 <400> SEQUENCE: 5 

372 cgggatccca aggagatacc atggcagtaa ctccttcc 38 

375 <210> SEQ ID NO: 6 

376 <211> LENGTH: 42 

377 <212> TYPE: DNA 

378 <213> ORGANISM: Artificial sequence 

380 <220> FEATURE: 

381 <223> OTHER INFORMATION: synthetic 

383 <400> SEQUENCE: 6 

384 ctgcgagctc ttattaaagc tcatcatgct gcagaaaata cc 42 

387 <210> SEQ ID NO: 7 

388 <211> LENGTH: 28 

389 <212> TYPE: DNA 

390 <213> ORGANISM: Artificial sequence 

392 <220> FEATURE: 

393 <223> OTHER INFORMATION: synthetic 

395 <400> SEQUENCE: 7 

396 ctgcgagctc ttattaatga tgatgatg 28 

399 <210> SEQ ID NO: 8 

400 <211> LENGTH: 42 

401 <212> TYPE: DNA 

402 <213> ORGANISM: Artificial sequence 

404 <220> FEATURE: 

405 <223> OTHER INFORMATION: synthetic 

407 <400> SEQUENCE: 8 

408 ggtgagctct taaagctcat catgggagca gaagataccg tc 42 

411 <210> SEQ ID NO: 9 

412 <211> LENGTH: 24 

413 <212> TYPE: DNA 

414 <213> ORGANISM: Artificial sequence 

416 <220> FEATURE: 

417 <223> OTHER INFORMATION: synthetic 

419 <400> SEQUENCE: 9 

420 gccgaattcg acaacatcct cagt 24 

423 <210> SEQ ID NO: 10 

424 <211> LENGTH: 42 

425 <212> TYPE: DNA 
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PATENT APPLICATION: US/10/098 , 514 TIME: 15:01:02 

Input Set : A:\PTO.VSK.txt 
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Please Note; 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review 
Sequence Listing to ensure that a corresponding explanation is presented in tl 
to <223> fields of each sequence which presents at least one n or Xaa, 

Seq#:2; Xaa Pos. 380 
Seq#:4; Xaa Pos. 380 
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L:15 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:191 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:2 after pos.:368 
L:359 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:4 after pos.:368 
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